Monaural Speech Separation Using Dual-Output Deep Neural Network with Multiple Joint Constraint
نویسندگان
چکیده
Monaural speech separation is a significant research field in signal processing. To achieve better performance, we propose three novel joint-constraint loss functions and multiple function for monaural based on dual-output deep neural network (DNN). The DNN model not only restricts the ideal ratio mask (IRM) errors of two outputs, but also constrains relationship estimated IRMs magnitude spectrograms clean signals, spectrogram mixed signal. constraint strength adjusted through parameters to improve accuracy model. Furthermore, solve optimal weighting coefficients optimization idea, which further improves performance system. We conduct series experiments GRID corpus validate superiority proposed method. results show that using perceptual evaluation quality, short-time objective intelligibility, source distortion ratio, interference artifact as metrics, method out-performs conventional Taking gender into consideration, carry out among Female-Female, Male-Male Male-Female cases, our robustness system compared with some previous approaches.
منابع مشابه
Deep Recurrent Neural Network Based Monaural Speech Separation Using Recurrent Temporal Restricted Boltzmann Machines
This paper presents a single-channel speech separation method implemented with a deep recurrent neural network (DRNN) using recurrent temporal restricted Boltzmann machines (RTRBM). Although deep neural network (DNN) based speech separation (denoising task) methods perform quite well compared to the conventional statistical model based speech enhancement techniques, in DNN-based methods, the te...
متن کاملDeep Ensemble Learning for Monaural Speech Separation
Monaural speech separation is a fundamental problem in robust speech processing. Recently, deep neural network (DNN) based speech separation methods, which predict either clean speech or an ideal time-frequency mask, have demonstrated remarkable performance improvement. However, a single DNN with a given window length does not leverage contextual information sufficiently, and the differences be...
متن کاملMonaural Speech Separation
Monaural speech separation has been studied in previous systems that incorporate auditory scene analysis principles. A major problem for these systems is their inability to deal with speech in the highfrequency range. Psychoacoustic evidence suggests that different perceptual mechanisms are involved in handling resolved and unresolved harmonics. Motivated by this, we propose a model for monaura...
متن کاملTwo-stage multi-target joint learning for monaural speech separation
Recently, supervised speech separation has been extensively studied and shown considerable promise. Due to the temporal continuity of speech, speech auditory features and separation targets present prominent spectro-temporal structures and strong correlations over the time-frequency (T-F) domain, which can be exploited for speech separation. However, many supervised speech separation methods in...
متن کاملNeural Dual Extended Kalman Filtering: Applications in Speech Enhancement and Monaural Blind Signal Separation
The removal of noise from speech signals has applications ranging from speech enhancement for cellular communications, to front ends for speech recognition systems. A nonlinear time-domain method called dual extended Kalman filtering (DEKF) is presented for removing nonstationary and colored noise from speech. We further generalize the algorithm to perform the blind separation of two speech sig...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Chinese Journal of Electronics
سال: 2023
ISSN: ['1022-4653', '2075-5597']
DOI: https://doi.org/10.23919/cje.2022.00.110